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Q^ ■ Abstract 

In this paper, we propose a general method for testing composite hypotheses. Our idea is 
to use confidence limits to define stopping and decision rules. The requirements of operating 
characteristic function can be satisfied by adjusting the coefficients of the confidence limits. 
For common distributions, such adjustment can be done via efficient computation by making 
use of the monotonicity of the associated operating characteristic function. We show that the 
problem of testing multiple hypotheses can be cast into the general framework of constructing 
sequential random intervals with prescribed coverage probabilities. We propose an inclusion 
principle for constructing multistage testing plans. It is demonstrated that our proposed 
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OO . testing plans can be substantially more efficient than the sequential probability ratio test and 






its variations. We apply our general methodology to develop an exact approach for testing 



^^ ■ hypotheses regarding the difference of two binomial proportions. 

o: 

O ■ 1 Introduction 

A fundamental issue of statistical inference is to test the parameter of a distribution. Let X be a 
p\ ' random variable defined in a probability space (il,^,Pr). Assume that the distribution of X is 

C^ I determined by a parameter € 0, where is the parameter space. A frequent problem is to test 

hypotheses r^ ■ < 9q versus J^ ■ 9 > Oi based on observations Xi, X2, • • • of X, where 6q < 9i 
are two parametric values in G. To control the risk of making wrong decisions, it is typically 
required that 

Pr{Reject J^o \ 9} < a, V6' G 9 such that 9 < 9o (1) 

Pr{ Accept J^ I 0} < /3, V^ € 6 such that > 0i, (2) 
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where a, /3 G (0, 1). The probabihty PrjAccept J^ \ 9} for 6 £ Q is referred to as the Operating 
Characteristic (OC) function. Since there is no requirement imposed on (^Oi^i) for the OC 
function, such interval {9o,9i) is referred to as an indifference zone. 

A famous solution to the above problem is the Sequential Probability Ratio Test (SPRT) 
proposed by Abraham Wald [TH] during World War II of last century. Various modifications of 
SPRT have also been proposed (see, |13j and the references therein). A major drawback of SPRT 
and its variations is that the test plans are efficient only for some parametric values. Another 
unpleasant feature is that the sample number of the tests may not be bounded. In this paper, to 
overcome the limitations of SPRT and its variations, we take a different path. Instead of using 
likelihood ratio as existing testing plans, we use confidence limits to define stopping rules and 
decision rules. A key idea is to compare confidence limits with the endpoint of indifference zone. 
We discovered that by adjusting the coefficients of the confidence limits, it is possible to control 
the risk of making erroneous decisions. We demonstrate that, for testing problems related to 
common distributions, the adjustment can be done by efficient computation and for such purpose 
it suffices to evaluate the OC function for the endpoints of the indifference zone. 

The remainder of this paper is organized as follows. In Section 2, we proposed our general 
principle of hypothesis testing in a sequential framework. In Section 3, we discuss the applications 
of the general principle to test the expectation of a random variable. Our development is motivated 
by the fact that the means of many important random variables are exactly the parameters of 
their distributions. In Section 4, we derived multistage testing plans from the fully sequential 
testing plans. In Section 5, we propose to use confidence limits to design multistage plans for 
testing multiple hypotheses. In Section 6, we apply the design method based on confidence limits 
to test hypotheses regarding functions of two binomial proportions. Section 7 is the conclusion. 
All proofs of theorems are given in Appendices. 

Throughout this paper, we shall use the following notations. The notation denotes an empty 
set. The set of real numbers is denoted by M. The set of positive integers is denoted by N. The 
expectation of a random variable is denoted by E[.]. We use the notation Pr{. | 9} to denote the 
probability of an event which is defined in terms of random variables parameterized by 9. The 
parameter 9 in Pr{. | 9} may be dropped whenever this can be done without introducing confusion. 
If Z is parameterized by 9, we denote Pr{Z < z \ 9} hy Fz{z,9) and Pr{Z > z \ 9} hy Gz{z,9) 
respectively. The support of a random variable Z is denoted by Iz, i.e., Iz = {Z{(jj) : w G Q}. 
The standard normal distribution is denoted by $(.). The other notations will be made clear as 
we proceed. 

2 Fully Sequential Tests of One-sided Hypotheses 

In this section, we shall consider the problem of testing hypotheses J^ '■ 9 <9q versus Mi : 9 > 6i 
stated in the introduction. 



Let ^{Xi, • • • , X„; 5) and '^{Xi, • • • , X„; 5) be confidence limits such that 

Pt{^{Xi,--- ,Xn;6) <e\9}>l-6, Pr{^(Xi, • • • ,X„;5) > ^ [ 6)} > 1 - 5 (3) 

for 6 G (0, 1) and n = 1, 2, • • • . Let C be a positive number less than min( — , ^). The stopping rule 
of our testing plan is defined as follows: 

Continue sampling until ^{Xi, ■ ■ ■ , X^, Co) > ^o or '^{Xi, • • • , X„; (f3) < 6i for some n. 

Upon termination of sampling, the decision rule of our testing plan is defined as follows: 

(i): If ^0 ^ -^{^i, ■ • ■ ) ^n', (oi) < '^{Xi, • • • , Xn] C/3) ^ Gi is not satisfied, then accept J^q when 
'^(Xi,--- ,X„;C/3) < 6li and reject ^ when^(Xi,--- ,X„;Ca) > 6*0. 

(ii): If 9q < ^{Xi, • • • , Xn] C") < '^(^i, ■ ■ ■ , Xn] Cl^) < ^i is satisfied, then accept J^ or reject 

Mq based on an arbitrarily pre-specified policy. On possible way to specify such policy is to 

accept Mq if 

/(Xi,--- ,X„;go) ^a 

f{Xi, ■ ■ ■ ,Xn;9i) (3 
and reject ^y^ otherwise. 

Here /(xi, • • • ,Xn',0) denotes the joint probability density function (for the continuous case) 
or the joint probability mass function (for the discrete case) of Xi, ■ ■ ■ ,Xn parameterized by 9. 

Throughout the paper, let n denote the number of samples at the termination of sampling. 
Clearly, if '^(Xi, • • • , X„; C/3) — ^(Xi, ■ ■ ■ , X„; (a) converges to in probability as n — )• oo, then 
Pr{n < oo \ 9} = 1. As will be seen in the next section, for a wide class of hypothesis testing 
problems, we can show that the sample sizes are absolutely bounded. Moreover, we can show that 
the probabilities of committing decision errors for the above testing plan can be adjusted below 
any prescribed level by choosing a sufRciently small value of C > 0. 

3 Testing the Expectation of a Random Variable 

In many situations, the parameter to be tested is equal to the expectation of the associated 
random variable. That is, 9 = E[X]. As proposed in the last section, our testing plan is defined 
in terms of confidence limits. In order to obtain confidence limits and establish the monotonicity 
of the OC function for the associated testing plan, we need to introduce the concept of unimodal- 
likelihood estimator (ULE) . For a random tuple Xi , • • • , Xj. (of deterministic or random length r) 
parameterized by 9, we say that the estimator (p{Xi, • • • , Xr) is a ULE of 9 if (p is a multivariate 
function such that, for any observation [xi, • • • , Xr) of (Xi, • • • , Xr), the likelihood function is non- 
decreasing with respect to 9 no greater than (p{xi, ■ ■ ■ ,Xr) and is non-increasing with respect to 
9 no less than (p{xi, ■ ■ ■ , Xr)- For discrete random variables Xi, • • • , X^, the associated likelihood 
function is the joint probability mass function Pr{Xj = Xi, i = I,--- ,r \ 9}. For continuous 
random variables Xi,--- ,Xr, the corresponding likelihood function is, fxi,- ,Xrixir " TXr,9), 
the joint probability density function of random variable Xi, • • • ,Xr. It should be noted that a 



ULE may not be a maximum-likelihood estimator (MLE). On the other side, a MLE may not be 
aULE. 

In this section, we shall focus on random variables such that their sample means are ULEs. 
Let Xn = \^ — -■, where Xi, X2, • • • are i.i.d. samples of X parameterized hy 9 = E[X]. Define 

Fj^jz,e) = Pr{Xn<z\e}, G^^{z,e) = Pr {x„ > z 1 0} . 

Suppose that {0 G G : F^ (z, 6) < 5} and |6' G : Gj^ {z, 9) < 5} are non-empty for z G J-^ , n € 
N, (5g (0, 1). Define 

L{z, 6) = max {9 £ e -.Gj^Jz, 6) < 5}, U {z, 6) = mm {9 e Q : F-j^Jz, 9) < 6} . (4) 

Therefore, ([3|) is satisfied if we define 

^(Xi,--- ,X„;5) = L(X„,,5), ^{X,,--- ,Xn;6) = U(Xn,6). (5) 

Suppose that the moment generating function ^{p, 9) = E[e''"^] of X exists for any p G M. 
Define 'W{z,9) = infpg]Re~''^E[e'' ]. For testing plans described in Section 2 with confidence 
limits defined by (JH) and ([5]), we have the following results. 

Theorem 1 Suppose that the sample mean Xn = — - is a ULE of 9. Then, 

\ ln(C«) ln(C/3) 1 

n < max 



ln^(^,0o)' ln^(^,ei)j" 

Moreover, Vi{Accept Mq \ 9} is non-increasing with respect to 9 £ Q such that 9 ^ {9q,9i). 
Furthermore, both Pr{Reject J% \ 9q} and Vi{Accept J^q \ 9i} tend to as C — )■ 0. 

See Appendix [B] for a proof. As can be seen from Theorem [U the risk requirements ([T]) and (I2D 
can be satisfied by choosing a sufficiently small (^ > 0. For this purpose, it suffices to ensure that 
([1]) and ([2]) hold for the endpoints of the indifference zone. If X is a Bernoulli random variable, 
then the confidence limits are exactly the same as Clopper- Pearson [11] . If X is a Poisson random 
variable, then the confidence limits are exactly the same as [12]. It should be noted that in these 
particular cases, recursive algorithms can be developed for the evaluation of the OC functions and 
consequently facilitate the determination of <^ satisfying ([1]) and dlj). 

Note that confidence limits (j3|) and ([5|) are directly derived from the exact tail probabilities 
of Xn- In many cases, simple bounds for the tail probabilities of X„ are available. Hence, it 
is reasonable to hope that the stopping boundary of the testing plans may be simpler if we use 
confidence limits obtained from the bounds of the tail probabilities of Xn- Toward this goal, we 
have the following results regarding confidence limits. 

Theorem 2 Suppose that the sample mean Xn = ^"n — ' "^^ '^ ULE of 9. Suppose that {0 G G : 
[<r(z,6')]" < 5, 9 < z} and {9 e e ■- ['^{z,9)f < d, 9 > z} are nonempty for z G %„, n G N and 
5 e (0,1). Define 

£{z, 6) = max {0 G : [^(z, 9)^ < S, 9 < z} , U{z, 6) = min {^ G G : [^(2, 0)]" < 6, 9 > z} . 



Then, 

PT{C(Xn,S) <0}>l-S, PT{U(Xn,S) > 9} > 1 - 5, PT{C(Xn,S) kO < U(X,aJ)} > 1 - 25. 

Moreover, both C{z, 6) and U{z, 5) are non- decreasing with respect to z £ I^ ■ 

See Appendix [C] for a proof. For testing plans described in Section 2 with confidence limits 
defined by Theorem [21 we have the following results. 

Theorem 3 Suppose that the sample mean X„ = '"^ — - is a ULE of 6. Then, 

\ ln(C«) ln(C/3) 1 

n < max ' * 



ln^(^,0o)' ln<^(^,ei)j' 

Moreover, Fr{Accept J% \ 9} is non-increasing with respect to 9 £ @ such that 9 ^ (6*0,^1). 
Furthermore, both Y'T:{Reject Mq \ 9q} and Vx {Accept ^q \ 9i} tend to as C ^- 0. 

See Appendix [P] for a proof. Although our proposed testing plans are derived from confidence 
limits, we can avoid the computation of confidence limits by virtue of the monotonicity of the tail 
probabilities (or their bounds) of Xn with respect to 9. Under the assumption that the sample 
mean Xn = ^"^ — ' ^^ ^ ULE of 9, we have the following observations: 

(i): L(Xn, Ca) >9o^ Gj^^iXn, 9o) < (a and U(Xn, 0) < 9i ^ %„(^n, ^1) < C/3. 

(ii): C(Xn,Ca) > 9o ^ nXn,9o) < (a, Z„ > 9o and UiX^XP) < ^1 ^ '^(X„,^i) < 
C/3, Xn<9i. 

Here the notation "<^" means "if and only if" . 

4 Multistage Tests of One-sided Hypotheses 

In the preceding discussion, we have been focusing on fully sequential tests. In many applications, 
it may not be practical to take one sample at a time. It may be much more efficient to perform 
testing with samples in groups. This motivates us to adapt the sequential procedure proposed in 
Section 2 to multistage tests. 

Consider hypotheses J^o '■ 9 < 9q versus ^ : 9 > 9i as before. We shall test such hypotheses 
with a testing plan of s stages. Let C, he a. positive number less than min(— , -g). Let ni < 77-2 < 
• • • < n^ be sample sizes, where n^ is the sample size of the ^-th stage. Assume that the largest 
sample size Us is no less than the smallest integer n such that {^(Xi, • • • , X„; C,a) < 9q < 9i < 
'^{Xi,--- ,X„;C/3)} = 0. 

The stopping rule of our testing plan is defined as follows: 

Continue sampling until Jf(Xi,--- ,X„^;Ca) > 9q or '^{Xi,--- ,X„^;C/3) < 9i for some 
££{!,■■■ ,s}. 

Upon termination of sampling, the decision rule of our testing plan is defined as follows: 



(i): If ^0 < -^{Xi^--- ,Xrn'-,C,a) < '^{Xi,--- ,Xni;Cf^) ^ ^i is not satisfied, then accept J% 
wlien ^ {Xi , • • • , Xnf ; C/3) < 01 and reject .y^ when ^{Xi , • • • , X„^ ; Ca) > ^o- 

(ii): If Oq < ^{Xi,--- ,Xng;Ca) < ^(X\,--- ,X„^;C/3) < Q\ is satisfied, then accept J^ or 

reject ^^ based on an arbitrarily specified policy. On possible option of such policy is to 

accept ^Q if 

/(Xi, • • • ,X„^;go) a 

/(Xi,- • • ,Xn{,B\) ~ /3 
and reject =^ otherwise. 

Here /(xi, • • • , Xn^ ; ^) denotes the joint probability density function for the continuous case or 
the joint probability mass function for the discrete case. The confidence limits J£{X\ , • • • , X^ , C*^) 
and '^(Xi, • • • , X„^, C/3) can be defined by (jl]), (l5|) or Theorem [2j With regard to such plans for 
testing Q = ]E[X], we have the following results. 

Theorem 4 Suppose that the sample mean X„ = '"^ — - is a ULE of 6. Then, Vi {Accept J^q \ 
0} is non-increasing with respect to 6 ^ Q such that 9 ^ {Oq^Oi). Moreover, Vr{Reject J^q \ 6q} < 
s(a and Pr{Accept J^ \ 9i} < sQp. Furthermore, both Fi{Reject J^ | ^o} f^^^ Pr{Accept ^q \ 
d\) tend to as C ~^ 0- 

Theorem H] can be shown by similar methods as that of Theorems [1] and [3l 

5 Multistage Tests of Multiple Hypotheses 

In this section, we shall propose a unified approach for constructing multistage plans for testing 
multiple hypotheses by virtue of sequences of fixed-sample-size confidence intervals. The general 
problem to be considered is described as follows. 

Let X be a random variable defined in a probability space (O, ^, Pr). Suppose the distribution 
of X is determined by an unknown parameter in a parameter space 0. In many applications, 
it is desirable to infer the true value of Q from random samples X\,X2, ■ ■ ■ of X. This topic can 
be formulated as a general problem of testing m mutually exclusive and exhaustive composite 
hypotheses: 

JToi^eeo, jri:^Gei, ..., =^„_i : ^ G e^_i, (6) 

where Go = {^ G 6 : < ^i}, e,„_i = {^ G 6 : ^ > ^,-„_i} and 6^ = {0 G 6 : 0^ < ^ < 0,+i}, i = 
1, • • • , m — 2 with 01 < ^2 < ■ ■ ■ < ^m-i- To control the probabilities of making wrong decisions, 
it is typically required that, for pre-specified numbers bi G (0, 1), 

Pr{ Accept ^ | 6*} > 1 - J^, V6l G 0^, i = 0, 1, ■ ■ ■ , m - 1 (7) 

with 00 = {^ G eo : < e;}, 0™_i = {0 G G„_i : Q > C-il and 0i = {6 e 9, : 6'/ < 6 < 
^i+il, i = 1, • • • , m — 2, where 9'^, 6" are parametric values in © such that 9[ < 6i, 6'^_^ > 9m~i 



and Oi-i < e'l_^ < e[ < Oi < O'l < e'-_^-^ < Oi+i for f = 2, • • • , m - 2. For i = 0, 1, • • • , m - 1, 

PrjAccept J^ \ 0} is referred to as an Operating Characteristic (OC) function. Since there is no 

jm-i-a. 



requirement imposed for controlhng the risk of making wrong decisions for in G \ U"Lg 0j 



U™"^ {9'^,9'/), such a remainder set, U™1^ {9[^6'l), is referred to as an indifference zone. The 
concept of indifference zone was introduced by Wald [TJ] for two main reasons. First, when the 
parameter is close to ^j, the margin between adjacent parameter subsets Qi-i and Oj, it is 
immaterial to decide whether M^i-i or J^i should be accepted. Second, the sample size required 
to make a reliable decision between consecutive hypotheses J^_i and Mi becomes increasingly 
intolerable as 9 tends to 6i. Undoubtedly, the indifference zone should be sufficiently "narrow" 
so that the consequence of making erroneous decision is practically unimportant when lies in 
it. In general, a testing plan in our proposed framework consists of s stages. For ^ = 1, • • • , s, 
the number of available samples (i.e., sample size) of the £-th stage is denoted by n^. For the £-th 
stage, a decision variable D^ = ^^(Xi, • • • , Xm) is defined in terms of samples ^i, • • • , Xj^ such 
that Di assumes m + \ possible values 0, 1, • • • , rra with the following notion: 

(i) Sampling is continued until D^ ^ for some £ G {1, • • • , s}. 

(ii) The hypothesis Mj is accepted at the ^-th stage if Di = j + 1 and Di = for \ <i < I. 

For practical considerations, we shall only focus on sampling schemes which are closed in the 
sense that Y'r{Ds = 0} = 0. For efficiency, a sampling scheme should satisfy the condition that 
both Pr{Z)i 7^ 0} and Pr{Z)s_i = 0} are greater than zero. 

Let I denote the index of stage when the sampling is terminated. Then, the sample number 
when the sampling is terminated, denoted by n, is equal to nj. We shall focus on multistage 
sampling schemes which can be defined in terms of estimator 93^ = ip[X\,--- ,X„) such that 
if^ is a ULE of 6 for every re and that (^„ converges in probability to 9 in the sense that, for 
any e > and 5 G (0,1), Pr{|(^„ — 9\ > e] < 5 provided that n is sufficiently large. Such 
estimator (^„ is referred to as a Unimodal-likelihood Consistent Estimator (ULCE) of 9. Assume 
that, for any 6 G (0,1) and any positive number re, a lower confidence limit L{ip^,n^6) and 
an upper confidence limit U{(f^,n,5) can be constructed in terms of estimator (p^ such that 
{L{cpj^,n,6) < (fn < U{(Pn,n,6)} is a sure event and that 

Pt{9 < L{if^, n, 5)\9]< 6, Ft{9 > C/((^„, n,5)\9}<6 

for any 9 ^ Q. To construct testing plans satisfying the requirement ([7]), we propose to use 
fixed-sample-size confidence limits satisfying such assumptions. For the i-th stage, an estimator 
6i for 9 can be defined in terms of samples Xi,- • • ,X„^ as (f^^ = ip{Xi, ■ ■ ■ ,X„^). Accordingly, 
the decision variables Di can be defined in terms of estimator 61 = (p . The overall estimator 
for 9, denoted by 6, is equal to 61. 

Our general principle for constructing multistage testing plans and their properties can be 
described by Theorem [S] as follows. 

Theorem 5 Let Ui = 0(C) G (0, 1), f3i = 0{C) G (0,1) for i = I,- ■■ ,m - I, where 0{C) denotes 
functions of the same order as (. Let oq = ai, /3m = Pm-i and am = /3o = 0. Let 9'q = -co 



and 9'^ = oo. Let ni < n2 < ■ ■ ■ < n^ be sample sizes such that the largest sample size iig is no 
less than the smallest positive integer n guaranteeing that {0'^ < L{(p^,n,ai) < U{(p„,n, f3i) < 9'^} 
is not an impossible event for i = 1,- ■ ■ ,m — 1. Suppose that (p^ is a ULCE of 9 and that the 
decision variables Di are defined such that 

{Di = ?} C {9[_^ < L{di, m, ai-i) < U{de, ne, ft) < 0f }, l<i<m 

for i = 1,- ■ ■ ,s. The following statements (I)- (VI) hold true for m>2. 

(I) Fi{Reject J^ \ 9} is non- decreasing with respect to 9 ^ 0q. 

(II) Pr{Reject Jifm-i \ 9} is non-increasing with respect to 9 ^ Om-i- 

(III) Pi{Reject .y^i I 0} < s(niax{aj : i < j < m} + niax{/3j : < j < i}) for any 9 € 0i and 
i = 0,1,- ■ ■ ,m — 1. 

(IV) For < i < m — 1, Pr{Accept J^i \ 9} is no greater than son Oind is non- decreasing with 
respect to 9 ^ Q no greater than 9[. 

(V) For < i < m — 2, Pr{Accept J^ \ 9} is no greater than sft+i and is non-increasing with 
respect to 9 ^ Q no less than ^f+i- 

(VI) Assume that E[e'' ] exists for any p € M and that t^„ = ^^i — ' ^-^ '^'^ unbiased and 
unimodal-likelihood estimator of 9, where Xi,X2,- ■ ■ are i.i.d. samples of X. Then, for i = 
0, 1, • • • , m. — 1, lim^_>,o P^{Reject J^i \ 9} = for any 9 £ 0i. 

Moreover, the following statements (VII), (VIII) and (IX) hold true for m > 3. 
(VII) 

Pi{Reject J^i\9} < Pi{Reject ^i, 6 < a \ a} + Pr{Reject J^i,e>b\b}, 
Pi{Reject M'i\9}> Pr{Reject J^i, 6 < a \ b} + Pi{Reject J^i,d>b\a} 

for any 9 € [a, b] C ©j and 1 < i < m — 2. 

(VIII) Pi{Reject J% and .^rn-x \ ^} is non- decreasing with respect to 9 € 0q and is non- 
increasing with respect to 9 £ 0m-i ■ 

(IX) PT{Reject ,J^ and .^rn-x \ ^} is no greater than s x max{aj : 1 < i < m — 2} for 9 £ 0q 
and is no greater than s x max{/?j : 2 < i < m — 1} for 9 € 0m~i ■ 

Theorem [5] asserts that the probabihties of committing decision errors for the proposed testing 
plan can be adjusted below any prescribed level by choosing a sufficiently small value of C > 0. 
This process of adjusting the probabilities of committing decision errors can be accomplished by 
the bisection risk tuning techniques established in our earlier paper [^. Actually, we had derived 
in [4] many concrete testing plans by the same methodology as that of the general stopping and 
decision rules described by Theorem [5j To see the fundamental principle behind the construction 
of stopping and decision rules, we will demonstrate that the general problem of designing multi- 
stage plans for testing multiple hypotheses can be reformulated as the problem of constructing 
a sequential random interval to satisfy certain specifications of coverage probability. Note that 
the word "sequential" is used to indicate the fact that the random interval is constructed from 



samples of random size. As a consequence of the connection between testing plans and sequential 
random intervals, the properties of the testing plan described by statements (I)-(IX) of Theorem 
[5] can be justified by the general theory of coverage probability of sequential random intervals 
established in Sections 2.5 and 2.6 of [3]. 

To illustrate why the general hypothesis testing problem defined by ^ and ([7]) can be cast 
into the framework of constructing sequential random intervals with pre-specified coverage prob- 
abilities, consider a multistage sampling scheme with sample sizes rii < n2 < ■ ■ ■ < Us for random 
variable X parameterized by G 0. Let = ip{Xi, • • • , X^), where n is the number of samples 
at the termination of the process of drawing samples Xi, X2, • • • . Let ^{6, n) and '^{9, n) be 
functions of 9 and n such that the following requirements are satisfied: 

(i): The sequential random interval (^(0,n),'^(0,n)) has m possible realizations: (6',i,6'"_^^), i = 
0,1, ■•• ,m-l. 

(ii): 

Fv{^{9,n) <e <'2^(e,-a)\0}>l-Si (8) 

for any 9 £ 0i and i = 0, 1, • • • ,m — 1. 

Given that the sequential random interval {^{9, n), ^{9, n)) satisfying requirements (i)~(ii) 
is constructed, the risk requirement ([7]) can be satisfied by using the sequential random interval 
{^{9, n), '^{9, n)) to define a decision rule such that, for i = 0,1,- ■ ■ , m — 1, hypothesis J^ is 
accepted when the sequential random interval {^{9, n), '^{9, n)) takes the i-th realization at the 
termination of the sampling process. Our claim that the risk requirement d?]) is satisfied can be 
justified by virtue of ([8]) and the observation that 

{Accept ^} = {if (0, n) <e < ^{9, n)} (9) 

for any 9 £ 0i and i = 0,1,- ■ ■ ,m — 1. 

The construction of the sequential random interval {J2f{d,n),'W{0,n)) fulfilling requirements 
(i)-(ii) can be accomplished as follows. Noting that 

Pr{^(0,n) <9<'W(9,n) | 0} = Pr{(if (?, n), ^(0, n)) C(0^,0f^i) {9} 

for any 9 G 0i with i € {0, 1,---,?ti— 1}, we propose to control the coverage probability of the 
sequential random interval by creating a particular inclusion relationship between m sequences 
of confidence intervals and the desired sequential random interval. For i = 0, 1, • • • ,m — 1, let 
{91, 9"^^) be referred to as the i-th realization of the sequential random interval {^{9, n), ^2^(9, n)). 
For i = I,--- ,s, let 9i = (p^^ = ip{Xi,--- ,X„J as before. For i = 0,1,- ■ ■ ,m — 1, let 
{L{9i,ni,ai),U{9£,n£, (3i^i)), i = 1,- ■ ■ ,s he referred to as the i-th controlling confidence se- 
quence. Our purpose of introducing the term "controlling confidence sequence" are two folded: 
(i) To indicate that the confidence sequences are used to control the coverage probability of the 
desired sequential random interval; (ii) To avoid the confusion between the sequences of confidence 



intervals and the sequential random intervals. In this setting, we propose the following stopping 
and decision rule: 

Continue the sampling process if there exists no index i S {0, 1, • • • ,m — 1} such that the i-th 
controlling confidence sequence is included by the i-th realization of the sequential random interval. 
At the termination of the sampling process, make the following decision: 

(a): If there exists a unique index i E {0, 1, • • • ,771 — 1} such that the i-th controlling confidence 
sequence is included by the i-th realization of the sequential random interval, then designate the 
i-th realization as the outcome of the sequential random interval. 

(h): If there exist two consecutive indexes i — 1 and i in {!,■ ■ ■ ,m — 1} such that the (i — 1)- 
th and i-th controlling confidence sequences are included, respectively, by the {i — l)-th and i-th 
realizations of the sequential random interval, then designate either the {i — l)-th or i-th realization 
as the outcome of the sequential random interval based on an arbitrarily pre-specified policy. 

As a consequence of the assumption that all controlling confidence sequences always include 
Oi for £ = 1,- ■ ■ , s, it is impossible that there are more than two indexes i G {0, 1, • • • , m— 1} such 
that the i-th controlling confidence sequence is included by the i-th realization of the sequential 
random interval. However, the situation described in (b) is possible. One possible policy to 
handle such a tied situation for the i-th stage is to designate the (i — l)-th realization as the 
outcome of the sequential random interval if Og < mg^i and designate the i-th realization as the 
outcome of the sequential random interval if 6^ > rrn^i, where m^^j can be taken as ra^^i = ' ^ ' 
or rui^i = i[min{2; G /g : L{z,ni,ai) > 6'^} + maxjz G /g : U{z,ni,Pi) < 6'/}] for £ = 1, ■ ■ ■ ,s 
and i = 1,- ■ ■ ,m — 1. 

Based on the above stopping and decision rule, the decision variables can be expressed as in 
terms of Og. For this purpose, define 

£/i^i = {z e Ig^ : L{z,ne,ai) > 9-}, 
^i,i = {zeI^^:U{z,n,,f]i)<9'/}, 

{z G Ig^ : 9', < L{z,ne,a,) < U{z,ne,Pi) < 9'^}, 

if ^e^i + and ^£,i = 0, 

if^^,i = 
if ^^ j ^ and ^£,i = 0, 

if J2/^ • = 

for ^ = 1, ■ ■ ■ , s and i=l,---,77i — 1, where q^j is a number such that min'^ii < C£^i < max^^^j. 
In particular, we can take cg^i = 2(niin'^£ j + max'^£ j) or q.j = ^{^i + ^f) foi^ i = 1, • • • ,m — 1. 
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Define fi^m = oo and gifi = — oo. Then, the decision variables can be defined as 

{i if there exists i £ {1, ■ • • , fn} such that ge^i-i < 9^ < fii 
(10) 
otherwise 

for i = 1,- ■ ■ ,s. At the termination of the sampHng process, the (z— l)-th reahzation is designated 
as the outcome of the sequential random interval {^{6, n), ^(0, n)) if Di = i. 

Under the assumption that ai = 0(C) and /3j = 0(C) for i = 1, • • • ,m — 1 and that the maxi- 
mum sample size Ug is no less than the minimum integer n guaranteeing that {O'^ < L{(p^, n, ai) < 
U{iPn, n, j3i) < 6'/} is not an impossible event for i = 1, • • • , m — 1, we can show that the coverage 
requirement ([8]) can be satisfied with the above stopping and decision rule by choosing C > to 
be a sufficiently small number. 

Prom above discussion on the reformulation of testing procedures, it can be seen that the 
testing plan described in Theorem [5] is proposed by the same idea of constructing the sequential 
random interval as above. Accordingly, the properties of the testing plan can be shown by virtue 
of ([9]) and the theory of coverage probability of sequential random intervals established in Sections 
2.5 and 2.6 of [3]. Most importantly, such reformulation demonstrates that the two seemingly 
different fields of sequential analysis, multistage parameter estimation and hypothesis testing, can 
be cast into a much broader and unified framework of constructing sequential random intervals 
of pre-specified coverage probabilities. A critical methodology to make such unification possible 
is the use of multiple confidence sequences to control the coverage probability of the desired 
sequential random interval. The controllability of the coverage probability is due to the inclusion 
relationship implemented by the stopping and decision rules. In view of its versatility, we call 
such a methodology of using confidence sequences to define stopping and decision rules as the 
Inclusion Principle (IP). In addition to IP, the design of multistage testing plans and estimation 
procedures can be tackled by the same computational method - bisection coverage tuning, which 
is discussed in our papers [3] and [1]. 

Haven established the fundamental principle of constructing testing plans, our next task is to 
make the stopping and decision rules as simple as possible. In situations that the parameter 9 to 
be tested is the expectation of X, for the sake of simplicity, one can replace the strict confidence 
limits by their approximations derived from the central limit theorem or by their bounds derived 
from probabilistic inequalities such as Chernoff bound. In particular, we can use the following 
methods for constructing approximate confidence intervals. Assume that Xi, X2, • • • are identical 

— def Y^" X 

samples of X and that the variance of the sample mean X^ = '"^ — - is a bivariate function, 
denoted by i^{9, n), of 6 and n. If the sample size n is large, then the central limit theorem may 
be applied to establish the normal approximation 

F-^^ (z, 9) "^ Pr{X„ <z\e}^^ i^fff^ I , (11) 

G^^^ (z, 9) 1^' Pr{X„ >z\9}^^\ ^==^ 1 . (12) 
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Let 5 € (0,1). If Xn is a ULE of 9, then we can obtain lower and upper confidence limits 
respectively as 



L{Xn,n,5) = mi{e eQ:<^ 



and 



such that 



U{Xn,n,5) =sup<eee:$ 



( x» 



\vw, 



> 



> 



n 



Pr{L(X„, n,d)>9\e}^-, Pr{[/(X„, n, 5) < | 0} 



2 



and Pr{L(X„, n,6) < < U{Xn, n,6) \ 9} ^ 1 — 6 for ^ G 0. To improve the accuracy of normal 
approximation (jlip and (J12p . we propose to replace 6 in 'f{9,n) hy z + w{9 — z) with w G [0, 1]. 
That is, we suggest modifying (jlip and (J12p as follows: 



%„(^,^)«^ 



G^(z,9)^'^ 



^r{z + w{9-z),n)l ' 



Vr(z + u;(0-z),n) 



(13) 
(14) 



The new parameter w is introduced to improve the accuracy of approximation. Based on the 
new approximation (|13p and (J14p . we propose to obtain lower and upper confidence limits for 9 
respectively as 



and 



L{Xn,n, 6) = inf < 



U{Xn,n, 5) = sup < 



6* G e : $ 



9-Xr 



'riXn + wi 




£9:$ 



Xn-9 



'r{Xn + w{9-Xn),n) 



> 



(15) 



(16) 



so that Fr{L{Xn,n,6) > 9 \ 9} ^ ^, Pi{U{Xn,n,d) < 9 \ 9} ^ ^ and Fr{L{Xn,n,6) < 9 < 
U{Xn,n, 6) I 0} ~ 1 — (5 for 9 G Q. The confidence limits constructed by this approach can be used 
to derive simple stopping and decision rules. Moreover, the performance of the testing plans can be 
optimized with respect to w G [0, 1]. It should be noted that for parameters of binomial, Poisson, 
geometric and hypergeometric distributions, explicit forms of confidence limits can be obtained 
from ([T5|) and ([T6|) by solving quadratic equations. For example, in scenarios that Xi,- ■ ■ ,X„ are 
i.i.d. samples of Bernoulli random variable X such that Pr{X = 1} = 1 — Pr{X = 0} = p G (0, 1), 
the lower and upper confidence limits for p can be readily derived from (fT5|) and (fT6]) respectively 

by 



L{Xn,n,6) 



Xn + ^[l-2{l-w)Xn]-Z 



n ~^ \2n I 



1 + 



{wZ)"^ 
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and 

,,,^ ,, ^n + ^[1 - 2(1 - ^)X„] + Z^^^^ + {^f 

U{Xn,n,5) = 



where Z is the critical value such that ^{Z) = 1 



I I {^'zr 



2- 



6 Multistage Tests for Functions of Two Binomial Proportions 

Testing hypotheses on functions of two binomial proportions is particularly important in prospec- 
tive comparative studies such as randomized controlled clinical trial. More formally, let X and Y 
be independent Bernoulli random variables such that Pr{X = 1} = 1 — Pr{X = 0} = px & {0,1) 
and Pr{y = 1} = 1 — Pr{y = 0} = Py € (0, 1). Let g{., .) be a bivariate function of Px and 
Py. It is a frequent problem to test hypotheses regarding g{px,Py) based on samples of X and Y . 
Typical examples of g{px,Py) are g{px,Py) = Px - Py and g{px,Py) = ^, which are respectively 
referred to as the difference of population proportions and ratio of population proportions. In the 
sequel, we shall design multistage plans for testing hypotheses regarding g{px,Py) by virtue of 
the method proposed in Section 5. The main idea is to make use of confidence limits to define 
stopping and decision rules. To address the problems of testing hypotheses regarding g{px,Py) 
in a unified framework, we define 6 = g{px,Py) and consider the general problem of testing m 
mutually exclusive and exhaustive composite hypotheses defined by ^ and d?]) . For simplicity of 
notations, let p = {px,Py) and abbreviate g{px,Py) as g{p). 

Our strategy is to seek a class of testing plans such that the risk of making wrong decisions 
can be controlled by the risk tuning parameter C > 0, and then apply the risk tuning technique 
to adjust the risk level to satisfy the requirement ([7]). Let 0(C) denote the class of functions of 
the same order of C- Let Oj = 0(C) € (0,1) and /3j = 0(C) € (0,1) for z = 0, 1, • • • , ?n,. Let 
6q = —oo and 9'^ = oo as before. Specifically, we seek a class of sampling schemes associated with 
C satisfying the following requirements: 

(i) The sample sizes iVi,x(C) < A^2,x(C) < • • • < N,^x{C) and iVi,j,(C) < A^2,j/(C) < • • • < Ns,y{C) 
are decreasing functions of the risk tuning parameter C > 0. Namely, the sample sizes are 
increasing as C decreases. For simplicity of notations, we abbreviate Ni^xiO^ ^i,yiC) as 
^£,x, ]^e,y respectively and define 

Ki^x = / ^^ii ^i,y = / ^ ^j' Ki = {Ki^x,Ke^y), 



for 



Ni,. 


Nt.y 


L^-. 


Ki,y = Y.^r, 


i=l 


i=l 


Ki^x 


Ki,y 
rtn . — —. 



Pe,x — T7 — ! Pe,y — TT — ; Pe — \Pi,x^Pl,y> 

NLx Ni^y 



(ii) For £ = I,--- ,s, one-sided confidence intervals [/3(p^,^, Qj), oo) and (— oo, ^/(p^,^, /3j)] for 
9 = g(p) can be constructed in terms of estimators p^, sample sizes Ni^x, Xi^y and the risk 
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tuning parameter C > such that 

C{Pi,e,ai) < g{p(,), PT{£{p(,,£,ai) > 9 \ p} < at, 

u{p„ e, ft) > g{p,), ft{u{p„ i, Pi) <e\p}<pu 

for z = 0, 1, • • • ,m and p € {{px,Py) : Px € (0, 1), Py G (0, 1)}. 

(iii) The samphng process is continued until 9[_i < C{p£,£,ai^i) < U{pi,i, (3i) < 0" for some 
i € {!,■ ■ ■ , m} at some stage with index ^ € {1, • • • ,s}. 

(iv) When the sampUng process is terminated at the i-th stage, there are only two possible cases 
(A) and (B) described as follows: 

Case (A) : There is a unique index i G {!,••• ,m} such that {^j'_i < C{p£,i,ai-i) < 

u{p„e,Pi)<e':}. 

Case (B) : There is a unique index i £ {!,- ■ ■ ,m — 1} such that {6'i_^i < C{p£,i,ai-i) < 
U{p,J,Pi) < 0'^} and {0'i < C{p„£,ai) < U{pe,e,Pi+i) < ^f+J- 
The decision is made based on the following policy: 

In Case (A), accept hypothesis J^i-i. In Case (B), accept J^i-i if g{p(^) < ' ^ ' ^^^^ 
accept J^i otherwise. 

(v) The sampling process is guaranteed to be terminated at or before the s-th stage. 

Within this class of sampling schemes, we can apply the bisection risk tuning technique to 
determine (" > as large as possible such that the risk requirement ([7]) is satisfied. In practices, 
the inequalities "Pr{>C(p£,£, a^) > \ p} < Oj" and "Fi:{U{p^,i, (3i) < ^ | p} < ft" in requirement 
(ii) can be respectively relaxed as 

Pr{/:(p„ i, ai)>0\p}^ ai, Pt{U{p„ i, ft) < | p} « ft 

provided that the risk of making wrong decisions can be controlled by C- The constraints 
''C{pi,£,ai) < giPi)'' and ''U{p£,£, (3i) > g{pe)" in requirement (ii) ensure that "Case (A)" 
and "Case (B)" are the only two possible cases at the termination of the sampling process. 

As an illustration of the construction of sampling schemes, we consider the problem defined 
by (HI) and ([7]) of testing m mutually exclusive and exhaustive composite hypotheses regarding 
= g{p) = Px — Py At each stage, one can use the following confidence limits 

for £ = !,••• ,s, where ^ and '^ are confidence limits investigated in [14j . which can be expressed 
as 



Zx(l - Ix) , Uy{l - Uy) 



■^{Px,Py, Nx,Ny, 5) =Px-py- Zs\ + 



Nx Ny 



Ux{l -Ux) , ly(l -ly) 



^{Px,Py, Nx,Ny, 6) = Px -Py + 2sJ r^^ h 



Nx Ny 
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where 



A; , „ . ,,1 ^ ( k 



I5x G <; ^ : A: = 0, 1, • • • , iV^ ^ Py e -i — : k = 0,1,- ■■ ,Ny 



^ c + 2N^p^ - v^c^ + AcN^p^jl - % 

2(c + iV,.) 



c + 2N^p^ 


+ Vc2 + 4ciV,p,(l 


-Px) 




2(c + Ar,) 




C + 2NyPy 


+ V^c2 + 4ciVJ^PJ^(l- 


-Py) 



_ c + 2iV^p^ - yjc'^ + 4ciV^p^(l - Py) 

with c = -E|. It can be checked that Ix-, Ux are the roots for p in the quadratic equation 
IPx — p| = 2sy^p{l — p)/Nx and that ly, Uy are the roots for p in the quadratic equation \py—p\ = 
'2'5-\/p(l ~ p)/^y Actually, there are many methods to construct confidence intervals satisfying 
requirement (i) (see, e.g., [Hill] and the references therein). 

For a given value of the risk tuning parameter ^, we have to determine sample sizes {Ng^xT^i,y)j 
£=!,■■■ , s such that the sampling process is guaranteed to be terminated at or before the s-th 
stage. This can be done as follows. 

In applications, the sample sizes for Y can be chosen to be increasing functions of sample 
sizes of X. In other words, one can define an increasing function F(.) such that Ni^y = F{Ni^x) 
for i = 1, • • • ,s. Therefore, for a given value of the risk tuning parameter ^, the maximum 
sample sizes {Ns^x, ^s,y) can be determined as {Nx, Ny), where Ny = r{Nx) with Nx guaranteeing 
that {iPx,Py) ■ O'i < .^iPx,Py,Nx,Ny,ai) < '^{px,Py,Nx,Ny,l3i) < 9'-} is nonempty for i = 
1, • • • ,m — 1. The minimum sample sizes (A'^i,x;-^i,j/) can be determined as {Nx,Ny), where 
Ny = r(Nx) with Nx guaranteeing that 

m 

\J{{Px,Py) : ^--i < ^{Px,Py,Nx,Ny,ai^i) < ^{px,Py,Nx,Ny,P,) < e'l} 

i=X 

is nonempty. To reduce the sampling cost, one should choose Nx as small as possible in the 
determination of [Ns^x^Ng^y) and (iVi^a;, A^i^^). Once the sample sizes for the first and last stages 
are determined, one can choose the sample sizes for other stages. For example, the sample sizes 
can be chosen as arithmetic or geometric progressions. 

Given that a class of testing plans associated with C, have been constructed such that the risk 
of making wrong decisions can be made arbitrarily small if C, is sufficiently small, the bisection risk 
tuning technique can be applied to determine Q as large as possible such that the risk requirement 
d?]) can be satisfied. A critical subroutine of the bisection risk tuning method is to apply Adapted 
Branch and Bound algorithm proposed in Appendix T of [3] to determine whether the probability 
of rejecting J^ is no greater than 5i for all p such that 9 = g{p) € 0j. This requires bounding 
the probability PrjRejecting ^ | p} for p in a rectangular domain Q = {{px,Py) '■ < p < 
Px I^Px — ^1 < p < Pj/ < Py < 1}. From our proposed method of designing multistage testing 
plans, it can be seen that, for a given value of risking tuning parameter C > 0, a testing plan can 
be described as follows: 
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At the £-th stage, the samphng process is continued if p^ € '^i. Otherwise, the samphng process 
is terminated and hypothesis ^ is accepted if pi G Sli^i. Here "^i and ^j/, i = 0, 1, ■ ■ ■ ,7n — 1 
are mutuahy exclusive and exhaustive subsets of the support of p^. In other words, the samphng 
plan is to continue sampling until p^ G ^j^^ for some £ G {1, • • • , s} and iG{0,l,--- ,?T2 — 1} and 
then accept hypothesis J^. 

With the above understanding of the testing plan, one can see that for i = 0, 1, • • • , m — 1, 



where 



{Rejecting J^} = U^^^i {p^ G "T^, 1 < £ < j; p^ G ^i;J 

u ^^.. 



(17) 



% 



def 



0<fc<m 



is the rejection region of hypothesis J^. To reduce the computational complexity for bounding 
PrjReJecting J^ | p}, we invoke the truncation method proposed in [5]. For this purpose, let 
r\ G (0, 1) and define 

Q,i; =Tib(p^,A'"£,a:,77), df,;j; =Tub(p^,iV<?,:r,?7), Q,j; = Tib(pj^,A'"f,y,??), d£^.y == Tubby, iV£,j;,?7) 
for i = 1,- ■ ■ ,s, where Tib(-, •, •) and Tub(-, •, ■) are multivariate functions such that 



Tib (6', n, -q) = max < 0, — 

n 



1- 


_20 /i + iiii^a^ 


1- 

71 fl 1 


2,3 

3n "^ In 1 


_20+^l + i«fi4^ 




2,3 
3« + lnl 



Tub(^,"-, ??) = min ■( 1, 



for 6 G [0,1], r] G (0,1) and n G N. By virtue of (J17p . Theorem 3 of [5], and Bonferroni's 
inequality, we have 

PrjRcjecting J^ | p} 
< 2sri + Pi-{ae.x < Pg.x < be.x, ae^y < pgy < be.y, £ = !,■■■ ,s; Rejecting Jf^ \ p} 



'^sr] + ^Pr{ai,x < Pg^^ < ^f,^;, ai^y < Pfy < bg^y, £ = 1, 



j=i 



P^e'^^, £ = !,■■■ ,j- 1; P, e &l. \ p} 



and 



< 2s77 + ^ Pr{af,2; < Pi^^ < ^t,x, ai^y < p^y < be,y, (. = !,■ 
Pf^(^%, i=l,--- ,j- 1; Pj e ^l^ I p} 



PrjRejecting Jfi \ p} > ^ Pr{Q,:r < Pe,x < di^x, ci^y < p^y < d(,y, £ = !,■■■ , j; 

i=i 

Pie%, £ = !,■■■ ,j-l; Pj e &Ij I p} 



(18) 



(19) 
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for all p £ Q and i = 0,1,- ■ ■ ,m — 1. It can be seen that the bounds of PrjRejecting ^ | p} in 
([18]) and (fT9l) can be expressed as summations of probabihstic terms hke Pr{/^j G J^i, I < i < 
£ I p}, i = !,••• ,s, where J^£ does not depend on p and is a subset of the support of Ki for 
£ = 1, • • • , s. Since ^ is independent of p, we can employ the recursive technique for bounding 
Prji^j G ^, 1 < i < ^ I p}, i = 1,- ■ ■ ,s over Q, which has been estabhshed in [H]- 

Given that the Adapted Branch and Bound algorithm can be used to determine whether 
PrjRejecting J^ | p} is no greater 6i for p G {(px^Py) ■ < p^ < 1, < Py < 1, g{px,Py) € ©J, 
the bisection risk tuning technique proposed in [3l 1^ can be applied to determine C, as large as 
possible such that the risk requirement ([7]) can be satisfied. 

7 Conclusion 

In this paper, we have developed a general approach for testing composite parametric hypothe- 
ses. We demonstrate that the problem of testing multiple hypotheses can be cast into a unified 
framework of constructing sequential random intervals with prescribed coverage probabilities. We 
propose inclusion principle for constructing multistage testing plans. Instead of using likelihood 
ratio to define testing plans, we use confidence limits to make early stopping possible. For a wide 
class problems of testing population means, the sample sizes of our proposed testing plans are ab- 
solutely bounded. Moreover, our testing plans are more efficient over a wide range of parametric 
values as compared to SPRT and its variations. Our approach can be applied to test arbitrary 
number of composite hypotheses. 

A Preliminary Results 

We need some preliminary results. 

Lemma 1 Let S he an event determined by random, tuple (Xi, ■ ■ ■ ,Xj.). Let '^{Xi^ ■ ■ ■ ,Xr) he a 
ULEofO. Then, 

(i) Vr{S' I 6} is non-increasing with respect to 9 £ Q no less than z provided that S C 

MXi,---,Xr)<z}. 

(ii) Prjff I ^} is non- decreasing with respect to 6 £ Q no greater than z provided that S C 

{(^(Xi,---,Xr)>4- 

Proof. We first consider the case that Xi,X2, • • • are discrete random variables. Let Ly- denote 
the support of r, i.e., Ir = {r(a;) : a; G 0}. Define ,^r = {{Xiioj)-, ■ ■ ■ ,Xr{ijj)) : a; G (f , Y{bj) = r} 
for r £ Lj.. Then, 

Pr{^|^}=^ ^ FT{X,=Xi, i = !,■■■ ,r\ 9}. (20) 

r£lr (xi,--- ,Xr)eS'r 



17 



To show statement (i), using the assumption that S C {ip[Xi,--- ^X^.) < z}, we have 
yp{xi, ■ ■ ■ ,Xr) < z for {xi, ■ ■ ■ ,Xr) € ^r with r € /r- Since f{Xi, ■ ■ ■ ,Xr) is a ULE of 9, we 
have that Pr{Xj = Xi, i = 1, ■ ■ ■ ,r \ 9} is non-increasing with respect to G no less than z. It 
follows immediately from (j20p that statement (i) is true. 

To show statement (ii), using the assumption that <# C {ip(Xi,--- ,Xj.) > z}, we have 
ip{xi,--- ,Xr) > z for {xi,--- ,Xr) G -^r with r € /r- Since 93(Xi,--- ,Xr) is a ULE of 0, we 
have that Pr{Xj = Xj, i = 1, • • • ,r | 0} is non-decreasing with respect to G no greater than 
z. It follows immediately from (j20p that statement (ii) is true. 

For the case that Xi,X2, • • • are continuous random variables, we can also show the lemma 
by modifying the argument for the discrete case. Specially, the summation of likelihood function 
Pr{Xj = Xi, i = 1,- ■ ■ ,r \ 9} over the set of tuple (xi, • • • ,Xr) is replaced by the integration of 
the joint probability density function fxi,---,Xr{xir " iXr,9) over the set of (xi,--- ,Xr)- This 
concludes the proof of Lemma [TJ 

D 

Lemma 2 Pr {Fz{Z) < a} < a and Pr{Gz{Z) < a} < a for any random variable Z and positive 
number a, where Fz{z) = Pi{Z < z} and Gz{z) = Vx{Z > z}. 

Actually, Lemma[2]is a well-known result, which can be proved by the following argument. Let 
Iz denote the support of Z. If {z G Iz '■ Fz{z) < a} is empty, then, {Fz{Z) < a} is an impossible 
event and thus Vx{Fz{Z) < a} = 0. Otherwise, we can define z* = max{z G Iz ■ Fz{z) < a}. It 
follows from the definition of z* that Fz{z*) < a. Since Fz{z) is non-decreasing with respect to 
z, we have {FziZ) < a} = {Z < z*}. Therefore, Pr{F^(Z) < a} = Fr{Z < z*} = Fz{z*) < a 
for any a > 0. By a similar method, one can show Pj:{Gz{Z) < a} < a for any a > 0. 

Lemma 3 Let X be a random variable parameterized by its mean E[X] = G 0. Suppose that 
X is a ULE of 9. Let Xn = — -, where Xi, ■ ■ ■ ,X„ are i.i.d. samples of random variable X. 

Then, 

Pr{X„ <z}< ["^(z, 9)r, \/z<9 
Pr{X„ >z}< ["^(z, 9)f, Vz > 9. 

Moreover, ^{z, 9) is non- decreasing with respect to 9 no greater than z and is non-increasing with 
respect to 9 no less than z. Similarly, ^{z,9) is non- decreasing with respect to z no greater than 
9 and is non-increasing with respect to z no less than 9. 

Proof. By the convexity of function e^' and Jensen's inequality, we have infp>oE[e''*-"''"~^'] > 
infp>o e'''^[^~^l > 1 for 61 > z. In view of infp<oE[e''(^~^)] < 1, we have "^(z, 0) = infp<oE[e''(-'^-^)] 
for 9 > z. Clearly, 'if{z,9) = infp<oe~''^E[e'' ] is non-decreasing with respect to z less than 9. 
Since X is a ULE of 9, we have that E[e''(^-^)] = e-P^E[eP^] = e"^^ /^"^g Pr{e^^ > u}du is 
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non-increasing with respect to 9 > z for p < and thus ^{z, 6) is non-increasing with respect to 
greater than z. 

Observing that infp>oE[e''(^"^)] < 1 and that infp<oE[e''(^-^)] > infp<o e/''^[^-^l > 1 for 
6* < z, we have '^(z,6l) = infp>oE[e^(^-^)] for 9 < z. Clearly, '^(z,6l) = infp>o e-^^E[e^^] is 
non-increasing with respect to z greater than 6. Since X is a ULE of 6, we have that E[e'''-*''"~^-'] = 
e"''^ /^o Pr{e''"^ > u}du is non-decreasing with respect to 6* for /? > and consequently, '^{z, 9) 
is non-decreasing with respect to 9 smaller than z. 

Making use of the established fact infp<o ^[eP^-^~^'] = "^{z^ 9) and the Chernoff bound Pr{X.„ < 
z} < [infp<oE[e/'(-'^-^)]]" (see, [2]), we have Pr{X„ < z} < ['rf{z,9)]'' for z < 9. Making 
use of the established fact infp>oE[e''(''^~^)] = '^{z,9) and the Chernoff bound Pr{X„ > z} < 
[infp>oE[e/'(^-^)]]", we have Pr{X„ > z} < ['<^{z,9)f for z > 9. This concludes the proof of 
Lemma [3j □ 



B Proof of Theorem 1 

Since Xn is a ULE for 0, it follows from Lemma[I]that Gj^ {z, 9) is non-decreasing with respect to 
9 eQ. Hence, by the definition of the lower confidence hmit, L(^2±^, Ca) > 6*0 if GyJ^^,9o) < 
(^a. Since Xn is a ULE for 9, it follows from Lemma [3] that 



a 



Xr 



9o + 9i 



Pr^X„>'^ + '° 



if 



n > 



2 

In(Ca) 



< 



^ 



h + 01 



< Qa 



ln<r(^,0o)' 



(21) 



Observing that G^ (-^j^) is non-increasing with respect to z, we have that L{z,C,ol) is non- 
decreasing with respect to z. This implies that L{z, Qa) > L(^a±^, (a) > 6^0 for z > ^^Mi if ([^ 
holds. 

On the other hand, since Xn is a ULE for 9, it follows from Lemma [1] that F^ {z,9) is non- 
increasing with respect to 9. Hence, by the definition of the upper confidence limit, U{ °"^ ^ ,C/^) ^ 
9i if F-^^{^^,9i) < C/3. Since X„ is a ULE for 9, it follows from LemmaSthat 



if 



^x 



+»i 



PriX„<<" + 



n > 



2 

ln(C/3) 



< 



'^ 



h + Oi 



<C/3 



ln<r(^,^i^ 



(22) 



Observing that F^ {z, 9) is non-decreasing with respect to z, we have that U{z, <^a) is non- 
decreasing with respect to z. This implies that U{z, C/3) < ^(^a+^, C/3) < 9i for z < ^a+^ if (|22]) 
holds. 
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Let rimax be the smallest integer n such that {L{Xn, C<^) < do < Oi < U{Xn, CP)} = 0- From 
the above analysis, we know that either U{z, C,f3) < 9i or L{z^ C,a) > Oq must be true for any 
z € I-j^ if (pTj) and ([22]) are satisfied. This implies that 



ln(C«) ln(C/3) 

ln'^(^,0o)' ln<^(^,ei) 



n < n^^ < max<; ^ ^^^,^^^^ . ^,^0+^, .^ 1^ ' (^3) 



By the definition of the testing plan, we have 

PrjReject J^o \ Oq} < ^ Pr{L(X„, (a) > 9o, n = n\ Oq} 



n=l 

^max 

< ^Pr{L(X„,Ca)>0o|^o} 

n=l 

< nmax C" 



and 

^inax 

Pr{ Accept J% \ Oi} < ^ Pr{C/(X„, C/3) < 6*1, n = n | 6*1} 

n=l 

^max 

< J^Pr{C/(X„,C/3)<^i|^i} 

n=l 

< nmax C/3- 

Making use of (f23]) . we have that both nmaxCo^ and nmaxC/3 tend to as C — >■ 0. It follows that 
PrjReject J^q \ Oq} and Pr{ Accept J% \ 9i} tend to as C —>^ 0. From the definition of the testing 
plan, we have that {Reject J^q} and {Accept J^q} are determined by random tuple {Xi, ■ ■ ■ , X^)- 
Moreover, {Reject J^o} C {Xn > ^0} and {Accept J^q} C {X„ < 6*1}. Hence, by Lemma [U we 
have that Pr{Accept J^q \ 6} is non-increasing with respect to ^ G G such that 6 ^ {60,61). This 
completes the proof of the theorem. 

C Proof of Theorem 2 

By the definition of the lower confidence limit, \^['rf{Xn,C{Xn,S))]"' < 5, C{Xn,S) < X„} is a 
sure event. It follows from Lemmas [2] and [3] that 

Pr{£(X„,5) >9} = PY{['^(X,,,C(Xn,SW <S,e< £(X„,(5) <X„} 

< Pr{GxJXn,C(Xn,6)) <6, 9< C(Xn,S) < X„} < Pr{G^JX„,0) < 5} < 5 

for 6* € e. By the definition of the upper confidence limit, {['r(X„,Z^(X„, 5))]" < 6, U(Xn,S) > X„} 
is a sure event. It follows from Lemmas [2] and [3] that 

Pr{U(Xn,5) <e} = Pr{[^(XnMXn,S))r < 6, Xn<U(Xn,S) < 0} 

< PT{Fj^jX„,U(Xn,S)) < 6, Xn<U{Xn,5) <e]< Pr{i^^jX„,0) < 5] < 5 
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for dee. 

Now, we show that C{z, 6) is non-decreasing with respect to z. Let zi < Z2 and Li = 
C{zi,6), L2 = C{z2, 5). To show the monotonicity of C{z, 6), it suffices to show L2 > Li for three 
cases (not necessarily mutually exclusive) as follows: 

Case(i): ["^(^2, ^2)]" < <^; 

Case (ii): ["^(^1,^1)]" < <5; 

Case (iii): [^(zi,zi)]" > 6 and [^(^2,^2)]" > S. 

In Case (i), we have L2 = Z2 > zi > Li. 

In Case (ii), we have Li = zi and ^{z2,zi)\^ < ["(^(zi, zi)]" < 5, which implies that L2 > 
zi = Li. 

In Case (iii), it must be true that ["^(zi, Li)]" = 6 and [^(22,-^2)]" = 5. Suppose, to get a 
contradiction, that Li > L2. Then, zi > Li > L2 and ^to{zi,L2) < 'rf{zi,Li) = ^{z2,L2), since 
^{z,6) is non-decreasing with respect to 6 < z. On the other hand, 'rf{zi,L2) > '^{z2,L2), since 
Z2 > zi > L2 and ^{z, 6) is non-increasing with respect to z > 0. This is a contradiction. Thus, it 
must be true that Li < L2- This completes the proof that C{z,6) is non-decreasing with respect 
to z. 

In a similar manner, we can show that U{z, 6) is non-decreasing with respect to z. This 
completes the proof of the theorem. 

D Proof of Theorem 3 



Since Xn is a ULE for 9, it follows from Lemma [3] that C{z, 6) is non-decreasing with respect 
to 9 no greater than z. Hence, it follows from the definition of the lower confidence limit that 



^ 



h + Oi 



Oo 



which is guaranteed by 



n > 



<Ca, 
In(Ca) 



< 



^0 + ^1 



ln<r(^,0o)' 



(24) 



By Theorem[2l >C(z, ("a) is non-decreasing with respect to z. Hence, >C(z, Ca) > £( "g ,Cq) > ^0 



for 2 > 



6*0 +6*1 



if (IMD holds. 



On the other hand, since Xn is a ULE for 9, it follows from Lemma [3] that C{z, 9) is non- 
increasing with respect to 9 no less than z. Hence, it follows from the definition of the upper 
confidence limit that U{^^XP) < ^1 if 



which is ensured by 



^(^^,^1 



n > 



<C/3, 
ln(C/3) 



> 



% + 9i 



ln^(^ 



(25) 



2— ^'^l; 
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By Theorem [21 U{z,C,fi) is non-decreasing with respect to z. Consequently, 



for z < ^aMi if ([25]) holds. This shows that either C{Xn,Ca) > Oq or U{Xn,C,P) < Oi must be 

true if 

In(Ca) ^ ln(C/3) 



ln<^(^,eo)' "ln<^(^,ei)' 

Let nmax be the smallest integer n such that {C{Xn,Coi) < Oo < di < U{Xn,Cl^)} = 0- Then, 

n<n , nia:: f ^"^^"^ ^n(C^^l 

n<n_<maxj^^^^^^^^^, j^^^^^^^^^j- (26) 

By the definition of the test plan, we have 

PrjRcjcct J^o \0o}<J2 Pi-{^(^". C«) > 00, n = 71 I Bo} < ^ Pr{/:(X„, (a) > ^o | ^o} < ^max C" 

n—l n—1 

and 

"max "max 

Pr{ Accept ^o I ^i} < £ Pr{^(^", C/?) < ^i, n = n | 0i} < ^ Pr{Z^(X„, C/3) < 0i \ 9i} < n„,ax C/?- 

71—1 n—l 

Making use of ([26]) . we have that both rima.xC'^ and nmaxC/5 tend to as C —> 0. It follows that 
PrjReject J^q \ 9o} and Pr{ Accept J% \ Oi} tend to as C ^ 0. 

From the definition of the testing plan, we have that {Reject rJ^} and {Accept J^} are deter- 
mined by random tuple {Xi, • • • ,Xn)- Moreover, {Reject J%} C {X„, > 6q} and {Accept J%} C 
{Xn ^ ^i}- Hence, by Lemma [H we have that Pr{ Accept J^ \ 9} is non-increasing with respect 
to € G such that 9 ^ (^O) ^i)- This completes the proof of the theorem. 
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